# Scalability parameters and ConfigROM

# Scalability parameters

Note that the current source code has only been tested for the nv\_small configuration, and while the nv\_small configuration passes many tests, additionall coverage is necessary to get it to tapeout quality.

| Scalability Parameter                                                  | Description                                                                                                          | INT8 Large<br>Config(nv_large ) | INT8 Small Config(nv_small) |
|------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------|---------------------------------|-----------------------------|
| NVDLA_FEATURE_D ATA_TYPE_BINARY /INT4/INT8/INT16 INT32/FP16/FP32 /FP64 | Identify the data type of input feature data                                                                         | INT8                            | INT8                        |
| NVDLA_WEIGHT_D ATA_TYPE_BINARY /INT4/INT8/INT16 INT32/FP16/FP32 /FP64  | ldentify the data type of input weight data                                                                          | INT8                            | INT8                        |
| NVDLA_WEIGHT_CO<br>MPRESSION_ENABL E                                   | Support of the feature of<br>weight compression.<br>Disable this can save area<br>in CBUF                            | YES                             | NO                          |
| NVDLA_WINOG<br>RAD_ENABLE                                              | Support of the optimization feature of weight compression. Disable this can save area of CSC/CMAC/CACC               | YES                             | NO                          |
| NVDLA_MAC_ATOMI<br>C_C_SIZE                                            | MAC atomic size of input channel number                                                                              | 64                              | 8                           |
| NVDLA_MAC_ATOMI<br>C_K_SIZE                                            | MAC atomic size of output kernel number                                                                              | 32                              | 8                           |
| NVDLA_MEMORY_A<br>TOMIC_SIZE                                           | Memory smallest access size, also the data cube is aligned with this size.  Note: the size is # of feature data type | 32                              | 8                           |
| NVDLA_BATCH_ENA BLE                                                    | Support of optimization feature of batch.  Disable this can save area in SDP/CACC/CSC/CD MA                          | YES                             | No                          |
| NVDLA_MAX_BATCH<br>_SIZE                                               | Maximum batch size, this will directly impact the local buffer size                                                  | 32                              | •                           |
| NVDLA_CBUF_BANK<br>_NUMBER                                             | Convolutional buffer bank number                                                                                     | 16                              | 32                          |
| NVDLA_CBUF_BANK<br>_WIDTH                                              | Convolutional buffer bank<br>data width                                                                              | 64B                             | 8B                          |
| NVDLA_CBUF_BANK<br>_DEPTH                                              | Convolutional buffer bank depth                                                                                      | 512                             | 512                         |

| Scalability Parameter                          | Scalability Parameter Description                                         |      | INT8 Small Config(nv_small ) |
|------------------------------------------------|---------------------------------------------------------------------------|------|------------------------------|
| NVDLA_SECONDARY _MEMIF_ENABLE                  | Support the secondary<br>memory interface<br>(SRAMIF)                     | Yes  | No                           |
| NVDLA_SDP_LUT_E<br>NABLE                       | SDP support Look-up-<br>Table for non-linear<br>function                  | Yes  | No                           |
| NVDLA_SDP_BS_E<br>NABLE                        | SDP support Bias/Scaling function                                         | Yes  | Yes                          |
| NVDLA_SDP_BN_E<br>NABLE                        | SDP support Batch-<br>Normaliza tion function                             | Yes  | Yes                          |
| NVDLA_SDP_EW_E<br>NABLE                        | SDP support Element-<br>wise-op eration function                          | Yes  | No                           |
| NVDLA_SDP_BS_TH<br>ROUGHPUT                    | Throughput of SDP<br>Bias/Scaling function                                | 16   | 1                            |
| NVDLA_SDP_BN_TH<br>ROUGHPUT                    | Throughput of SDP Batch-<br>Normaliza tion function                       | 16   | 1                            |
| NVDLA_SDP_EW_TH<br>ROUGHPUT                    | Throughput of SDP<br>Element-wise-op eration<br>function                  | 4    | •                            |
| NVDLA_BDMA_ENA BLE                             | Support the bridge DMA engine function                                    | Yes  | No                           |
| NVDLA_RUBIK_ENA BLE                            | Support the rubik engine function                                         | Yes  | No                           |
| NVDLA_PDP_ENABL E                              | Support PDP engine function                                               | Yes  | Yes                          |
| NVDLA_PDP_THROU<br>GHPUT                       | Throughput of PDP engine function                                         | 8    | 1                            |
| NVDLA_CDP_ENAB LE                              | Support CDP engine function                                               | Yes  | Yes                          |
| NVDLA_CDP_THROU<br>GHPUT                       | Throughput of CDP engine function                                         | 8    | 1                            |
| NVDLA_PRIMARY_M<br>EMIF_MAX_BURST<br>_LENGTH   | Primary memory interface<br>maximum burst length<br>number                | 1    | 4                            |
| NVDLA_PRIMARY_M<br>EMIF_WIDTH                  | Primary memory interface data width                                       | 64B  | 8B                           |
| NVDLA_PRIMARY_M<br>EMIF_LATENCY                | Primary memory interface<br>data return latency cycles<br>for read access | 1200 | 50                           |
| NVDLA_SECONDARY<br>_MEMIF_MAX_BURS<br>T_LENGTH | Secondary memory<br>interface (SRAMIF)<br>maximum burst length<br>number  | 4    | •                            |
| NVDLA_SECONDARY<br>_MEMIF_WIDTH                | Secondary memory<br>interface (SRAMIF) data<br>width                      | 64B  | •                            |

| Scalability Parameter             | Scalability Parameter Description                                              |     | INT8 Small Config(nv_small) |  |
|-----------------------------------|--------------------------------------------------------------------------------|-----|-----------------------------|--|
| NVDLA_SECONDARY<br>_MEMIF_LATENCY | Secondary memory interface (SRAMIF) data return latency cycles for read access | 128 | •                           |  |
| NVDLA_MEMIF_ADD<br>RESS_WIDTH     | Address bit width for external memory interface                                | 64  | 32                          |  |

#### More Details

#### NVDLA\_MAC\_ATOMIC\_C\_SIZE and NVDLA\_MAC\_ATOMIC\_K\_SIZE

These two parameters affect determine the number of mac cells and number of multipliers in each mac cell. The total number of multipliers is NVDLA\_MAC\_ATOMI C\_C\_SIZE \* NVDLA\_MAC\_ATOMI C\_K\_SIZE. The number of kernels in one kernel group is NVDLA\_MAC\_ATOMI C\_K\_SIZE.

#### NVDLA\_MEMORY\_ATOMIC\_SIZE

This parameter determines the size of one atomic cube which is 1x1xNVDLA\_MEMORY\_ATOMIC\_SIZE. Feature, Weight, Bias, PReLU, Batch Normalization and Element-Wise data cubes are split into atomic cubes before loaded or stored into memory. In nvdlav1 and nv\_large configurations, size of atomic cbue is 1x1x32. In nv\_small configuration, it's 1x1x8. Line stride and surface stride should also align to NVDLA\_MEMORY\_ATOMIC\_SIZE.

# NVDLA\_CBUF\_BANK\_NUMBER, NVDLA\_CBUF\_BANK\_WIDTH and NVDLA\_CBUF\_BANK\_DEPTH

These parameters determine the size of cbuf. (NVDLA\_CBUF\_BANK\_NUMBER \* NVDLA\_CBUF\_BANK\_WIDTH \* NVDLA\_CBUF\_BANK\_DEPTH) For nv\_small, the size is 32\*8\*512 = 128KB. For nv\_small\_256, the size is 32\*32\*128 = 128KB.

#### Config nv\_small\_256

This config is created for higher convolution performance than nv\_small. Comparing with nv\_small, the difference is that CMAC has 256 multipliers, not 64. In this configuration NVDLA\_MAC\_ATOMIC\_C\_SIZE is 32 and NVDLA\_MAC\_ATOMI C\_K\_SIZE is 8. Accordingly, NVDLA\_CBUF\_BANK\_WIDTH is 32 and NVDLA\_CBUF\_BANK\_DEPTH is 128.

## Sub-unit identifier table

| Sub-unit Identifier | Sub-unit    |
|---------------------|-------------|
| Sub-unit identine   | Name        |
| 0x0000              | End of list |
| 0x0001              | GLB         |
| 0x0002              | CIF         |
| 0x0003              | CDMA        |
| 0x0004              | CBUF        |
| 0x0005              | CSC         |
| 0x0006              | CMAC        |
| 0x0007              | CACC        |
| 0x0008              | SDP_RDMA    |
| 0x0009              | SDP         |
| 0x000a              | PDP_RDMA    |
|                     |             |

| Sub-unit<br>Name |
|------------------|
| PDP              |
| CDP_RDMA         |
| CDP              |
| BDMA             |
| RUBIK            |
|                  |

#### Note:

- 1. CIF(ID=0x0002) can be configured to MCIF or SRAMIF.
- 2. There are two CMACs in nv\_small and nv\_large. (CMAC\_A and CMAC\_B)
- 3. CBUF doesn't have registers.

# Descriptors and payloads of sub-units in ConfigROM

The reg offset in bellow tables are the relative offset to the beginning of current descriptor.

#### **GLB**

| Reg offset (in Byte) | Reg name | Reg fields                     | Value in nv_small config | Value in nv_large config |
|----------------------|----------|--------------------------------|--------------------------|--------------------------|
|                      |          | Bits 0-15: unit id.            |                          |                          |
| 0x0                  | GLB_DESC | Bits 16-31: payload<br>length. | 0x00000001               | 0x00000001               |

#### CIF

|                             | Reg offset (in<br>Byte) | Reg name              | Reg fields                                                                                        | Value in<br>nv_smal l<br>config | Value in<br>nv_larg e<br>config (MCIF) | Value in<br>nv_larg e<br>config<br>(SRAMIF ) |
|-----------------------------|-------------------------|-----------------------|---------------------------------------------------------------------------------------------------|---------------------------------|----------------------------------------|----------------------------------------------|
|                             |                         |                       | Bits 0-15: unit id.                                                                               |                                 |                                        |                                              |
|                             | 0x0                     | CIF_DES C             | Bits 16-31:<br>payload<br>length.                                                                 | 0x00180 002                     | 0x00180 002                            | 0x00180 002                                  |
| Incompa tible capabil ities | 0x4                     | CIF_CAP<br>_INCOMP AT |                                                                                                   | 0x0                             | 0x0                                    | 0x0                                          |
| Compati ble capabil ities   | 0x8                     | CIF_CAP<br>_COMPAT    | bit 0: CIF_IS<br>_SRAM. Set to<br>1 if this CIF is<br>connect ed to<br>a separat e<br>SRAM block. | 0x0                             | 0x0                                    | 0x1                                          |
| Baselin e<br>paramet ers    | Охс                     | CIF_BAS<br>E_WIDTH    | bits 0-7: width<br>(max 256B)                                                                     | 0x8                             | 0x40                                   | 0x40                                         |

| Reg offset (in<br>Byte) | Reg name                            | Reg fields                                       | Value in<br>nv_smal l<br>config | Value in<br>nv_larg e<br>config (MCIF) | Value in<br>nv_larg e<br>config<br>(SRAMIF ) |
|-------------------------|-------------------------------------|--------------------------------------------------|---------------------------------|----------------------------------------|----------------------------------------------|
| 0x10                    | CIF_BAS<br>E_LATEN CY               | bits 0-15:<br>latency (max<br>65535 cycles)      | 0x32                            | 0x4b0                                  | 0x80                                         |
| 0x14                    | CIF_BAS E_<br>BURST_L<br>ENGTH_M AX | bits 0-7:<br>max_bur<br>st_leng th<br>(max 256B) | 0x4                             | 0x4                                    | 0x4                                          |
| 0x18                    | CIF_BAS<br>E_MEM_A<br>DDR_WID TH    | memory<br>interfa ce<br>address width            | 0x20                            | 0x40                                   | 0x40                                         |

## CDMA

|                             | Reg offset (in<br>Byte) | Reg name                        | Reg fields                                        | Value in nv_small config | Value in nv_large config |
|-----------------------------|-------------------------|---------------------------------|---------------------------------------------------|--------------------------|--------------------------|
|                             |                         |                                 | Bits 0-15: unit id.                               |                          |                          |
|                             | 0x0                     | CDMA_DES C                      | Bits 16-31:<br>payload length.                    | 0x003400 03              | 0x003400 03              |
| Incompat ible capabili ties | 0x4                     | CDMA_CAP<br>_INCOMPA T          |                                                   | 0x0                      | 0x0                      |
|                             |                         |                                 | bit 0: WINOGRAD                                   |                          |                          |
|                             |                         |                                 | bit 1: MULTI_BA<br>TCH                            |                          |                          |
| Compatib le capabili ties   | 0x8                     | CDMA_CAP<br>_COMPAT             | bit 2: FEATURE_<br>COMPRESS ION                   | 0x10                     | 0x1b                     |
| capabili iles               |                         | _0017                           | bit 3: WEIGHT_C<br>OMPRESSI ON                    |                          |                          |
|                             |                         |                                 | bit 4: IMAGE_IN                                   |                          |                          |
|                             |                         |                                 | bit 31: 1'b0                                      |                          |                          |
| Baseline<br>paramete rs     | 0xc                     | CDMA_BAS<br>E_FEATUR<br>E_TYPES | Supporte d data<br>types of input<br>feature data | 0x10                     | 0x10                     |
|                             | 0x10                    | CDMA_BAS<br>E_WEIGHT<br>_TYPES  | Supporte d data<br>types of input<br>weight data  | 0x10                     | 0x10                     |
|                             | 0x14                    | CDMA_BAS<br>E_ATOMIC _C         | atomic_c                                          | 0x8                      | 0x40                     |
|                             | 0x18                    | CDMA_BAS<br>E_ATOMIC _K         | atomic_k                                          | 0x8                      | 0x20                     |
|                             | 0x1c                    | CDMA_BAS<br>E_ATOMIC _M         | atomic_m                                          | 0x8                      | 0x20                     |

|                               | , ,                     |                                       | -                                            |                          |                          |
|-------------------------------|-------------------------|---------------------------------------|----------------------------------------------|--------------------------|--------------------------|
|                               | Reg offset (in<br>Byte) | Reg name Reg fields                   |                                              | Value in nv_small config | Value in nv_large config |
|                               | 0x20                    | CDMA_BAS<br>E_CBUF_B<br>ANK_NUM       | cbuf_ban<br>k_number                         | 0x20                     | 0x10                     |
|                               | 0x24                    | CDMA_BAS<br>E_CBUF_B<br>ANK_WIDT H    | cbuf_ban<br>k_width                          | 0x8                      | 0x40                     |
|                               | 0x28                    | CDMA_BAS<br>E_CBUF_B<br>ANK_DEPT H    | cbuf_ban<br>k_depth                          | 0x200                    | 0x200                    |
| Capabili ties'<br>paramete rs | 0x2c                    | CDMA_MUL<br>TI_BATCH _MAX             | max_batc h                                   | 0x0                      | 0x20                     |
|                               | 0x30                    | CDMA_IMA<br>GE_IN_FO<br>RMATS_PA CKED | Supporte d<br>packed image<br>formats        | 0x0cfff0 01              | 0x0cfff0 01              |
|                               | 0x34                    | CDMA_IMA<br>GE_IN_F0<br>RMATS_SE MI   | Supporte d semi-<br>pla nar image<br>formats | 0x3                      | 0x3                      |

## **CBUF**

|                             | Reg offset (in<br>Byte) | Reg name                  | Reg fields                                       | Value in nv_small config | Value in nv_large config |
|-----------------------------|-------------------------|---------------------------|--------------------------------------------------|--------------------------|--------------------------|
|                             | 0x0                     | CBUF_DES C                | Bits 0-15: unit id.  Bits 16-31: payload length. | 0x001800 04              | 0x001800 04              |
| Incompat ible capabili ties | 0x4                     | CBUF_CAP<br>_INCOMPA T    |                                                  | 0x0                      | 0x0                      |
| Compatib le capabili ties   | 0x8                     | CBUF_CAP<br>_COMPAT       |                                                  | 0x0                      | 0x0                      |
| Baseline<br>paramete rs     | 0xc                     | CBUF_BAS<br>E_BANK_N UM   | cbuf_ban<br>k_number                             | 0x20                     | 0x10                     |
|                             | 0x10                    | CBUF_BAS<br>E_BANK_WIDTH  | cbuf_ban<br>k_width                              | 0x8                      | 0x40                     |
|                             | 0x14                    | CBUF_BAS<br>E_BANK_D EPTH | cbuf_ban<br>k_depth                              | 0x200                    | 0x200                    |
|                             | 0x18                    | CBUF_BAS<br>E_CDMA_I D    | cdma_id                                          | 0x3                      | 0x4                      |

## CSC

|                             | Reg offset (in<br>Byte) | Reg name             | Reg fields                     | Value in nv_small config | Value in nv_large config |
|-----------------------------|-------------------------|----------------------|--------------------------------|--------------------------|--------------------------|
|                             |                         |                      | Bits 0-15: unit id.            |                          |                          |
|                             | 0x0                     | CSC_DESC             | Bits 16-31:<br>payload length. | 0x003000 05              | 0x003000 05              |
| Incompat ible capabili ties | 0x4                     | CSC_CAP_<br>INCOMPAT |                                | 0x0                      | 0x0                      |

|                            | Reg offset (in<br>Byte) | Reg name                           | Reg fields                      | Value in nv_small config | Value in nv_large confi |
|----------------------------|-------------------------|------------------------------------|---------------------------------|--------------------------|-------------------------|
|                            |                         |                                    | bit 0: WINOGRAD                 |                          |                         |
|                            |                         |                                    | bit 1: MULTI_BA<br>TCH          |                          |                         |
| Compatib le capabili ties  | 0x8                     | CSC_CAP_<br>COMPAT                 | bit 2: FEATURE_<br>COMPRESS ION | 0x10                     | 0x1b                    |
|                            |                         |                                    | bit 3: WEIGHT_C<br>OMPRESSI ON  |                          |                         |
|                            |                         |                                    | bit 4: IMAGE_IN                 |                          |                         |
|                            |                         |                                    | bit 31: 1'b0                    |                          |                         |
| Baseline                   |                         | CSC_BASE                           | Supporte d data                 |                          |                         |
| paramete rs                | 0xc                     | _FEATURE                           | types of input                  | 0x10                     | 0x10                    |
| parameters                 |                         | _TYPES                             | feature data                    |                          |                         |
|                            |                         | CSC_BASE                           | Supporte d data                 |                          | 0x10                    |
|                            | 0x10                    | _WEIGHT_                           | types of input                  | 0x10                     |                         |
|                            |                         | TYPES                              | weight data                     |                          |                         |
|                            | 0x14                    | CSC_BASE                           | atomic_c                        | 0x8                      | 0x40                    |
|                            |                         | _ATOMIC_ C                         |                                 |                          |                         |
|                            | 0x18                    | CSC_BASE<br>_ATOMIC_ K             | atomic_k                        | 0x8                      | 0x20                    |
|                            | 0x1c                    | CSC_BASE<br>_ATOMIC_ M             | atomic_m                        | 0x8                      | 0x20                    |
|                            | 0x20                    | CSC_BASE<br>_CBUF_BA<br>NK_NUM     | cbuf_ban<br>k_number            | 0x20                     | 0x10                    |
|                            | 0x24                    | CSC_BASE<br>_CBUF_BA<br>NK_WIDTH   | cbuf_ban<br>k_width             | 0x8                      | 0x40                    |
|                            | 0x28                    | CSC_BASE<br>_CBUF_BA<br>NK_DEPGT H | cbuf_ban<br>k_depth             | 0x200                    | 0x200                   |
|                            | 0x2c                    | CSC_BASE<br>_CDMA_ID               | cdma_id                         | 0x3                      | 0x4                     |
| Capabili ties' paramete rs | 0x30                    | CSC_MULT<br>I_BATCH_ MAX           | max_batc h                      | 0x0                      | 0x20                    |

#### CMAC

There are two CMAC (CMAC\_A and CMAC\_B) in NVDLA nv\_small and nv\_large design. Their descriptors and payloads are same. They use different slots of address space.

| Reg offset (in | Pog nama | Reg fields | Value in nv_small | Value in        |
|----------------|----------|------------|-------------------|-----------------|
| Byte)          | Reg name | Reg lietus | config            | nv_large config |

|                             | Reg offset (in<br>Byte) | Reg name                        | Reg fields                                        | Value in nv_small config | Value in<br>nv_large config |  |
|-----------------------------|-------------------------|---------------------------------|---------------------------------------------------|--------------------------|-----------------------------|--|
|                             |                         |                                 | Bits 0-15: unit id.                               |                          |                             |  |
|                             | 0x0                     | CMAC_DES C                      | Bits 16-31:<br>payload length.                    | 0x001c00 06              | 0x001c00 06                 |  |
| Incompat ible capabili ties | 0x4                     | CMAC_CAP<br>_INCOMPA T          |                                                   | 0x0                      | 0x0                         |  |
| Compatib le capabili ties   | 0x8                     | CMAC_CAP<br>_COMPAT             | bit 0: WINOGRAD<br>bit 31: 1'b0                   | 0x0                      | 0x0                         |  |
| Baseline<br>paramete rs     | Охс                     | CMAC_BAS<br>E_FEATUR<br>E_TYPES | Supporte d data<br>types of input<br>feature data | 0x10                     | 0x10                        |  |
|                             | 0x14                    | CMAC_BAS<br>E_ATOMIC _C         | atomic_c                                          | 0x8                      | 0x40                        |  |
|                             | 0x18                    | CMAC_BAS<br>E_ATOMIC_K          | atomic_k                                          | 0x8                      | 0x20                        |  |
|                             | 0x1c                    | CMAC_BAS<br>E_CDMA_I D          | cdma_id                                           | 0x3                      | 0x4                         |  |

#### CACC

|                               | Reg offset (in<br>Byte) | Reg name                        | Reg fields                                        | Value in nv_small config | Value in nv_large config |
|-------------------------------|-------------------------|---------------------------------|---------------------------------------------------|--------------------------|--------------------------|
|                               |                         |                                 | Bits 0-15: unit id.                               |                          |                          |
|                               | 0x0                     | CACC_DES C                      | Bits 16-31:<br>payload length.                    | 0x002000 07              | 0x002000 07              |
| Incompat ible capabili ties   | 0x4                     | CACC_CAP<br>_INCOMPA T          |                                                   | 0x0                      | 0x0                      |
|                               |                         |                                 | bit 0: WINOGRAD                                   |                          |                          |
| Compatib le capabili ties     | 0x8                     | CACC_CAP<br>_COMPAT             | bit 1: MULTI_BA<br>TCH                            | 0x0                      | 0x3                      |
|                               |                         |                                 | bit 31: 1'b0                                      |                          |                          |
| VBaselin e<br>paramete rs     | Охс                     | CACC_BAS<br>E_FEATUR<br>E_TYPES | Supporte d data<br>types of input<br>feature data | 0x10                     | 0x10                     |
|                               | 0x10                    | CACC_BAS<br>E_WEIGHT<br>_TYPES  | Supporte d data<br>types of input<br>weight data  | 0x10                     | 0x10                     |
|                               | 0x14                    | CACC_BAS<br>E_ATOMIC _C         | atomic_k                                          | 0x8                      | 0x20                     |
|                               | 0x18                    | CACC_BAS<br>E_ATOMIC _K         | atomic_m                                          | 0x8                      | 0x20                     |
|                               | 0x1c                    | CACC_BAS<br>E_CDMA_I D          | cdma_id                                           | 0x3                      | 0x4                      |
| Capabili ties'<br>paramete rs | 0x20                    | CACC_MUL<br>TI_BATCH _MAX       | max_batc h                                        | 0x0                      | 0x20                     |

# SDP\_RDMA

|                              | Reg offset (in<br>Byte) | Reg name                       | Reg fields                                       | Value in nv_small config | Value in nv_large config |
|------------------------------|-------------------------|--------------------------------|--------------------------------------------------|--------------------------|--------------------------|
|                              | 0x0                     | SDP_RDMA<br>_DESC              | Bits 0-15: unit id.  Bits 16-31: payload length. | 0x000e00 08              | 0x000e00 08              |
| Incompat ible capabili ties  | 0x4                     | SDP_RDMA<br>_CAP_INC<br>OMPAT  |                                                  | 0x0                      | 0x0                      |
| Compatib le<br>capabili ties | 0x8                     | SDP_RDMA<br>_CAP_COM PAT       |                                                  | 0x0                      | 0x0                      |
| Baseline<br>paramete rs      | Охс                     | SDP_RDMA<br>_BASE_AT<br>OMIC_M | atomic_m                                         | 0x8                      | 0x20                     |
|                              | 0xe                     | SDP_RDMA<br>_BASE_SD P_ID      | sdp_id (slot id of<br>correspo nding<br>sdp)     | 0x9                      | 0xa                      |

#### SDP

|                               | Reg offset (in<br>Byte) | Reg name                       | Reg fields                                        | Value in nv_small config | Value in nv_large confi |
|-------------------------------|-------------------------|--------------------------------|---------------------------------------------------|--------------------------|-------------------------|
|                               | 0x0                     | SDP_DESC                       | Bits 0-15: unit id.  Bits 16-31:                  | 0x002000 09              | 0x002000 09             |
| Incompat ible capabili ties   | 0x4                     | SDP_CAP_<br>INCOMPAT           | payload length.                                   | 0x0                      | 0x0                     |
|                               |                         |                                | bit 0: WINOGRAD                                   |                          |                         |
|                               |                         |                                | bit 1: MULTI_BA<br>TCH                            |                          |                         |
|                               | 0x8 SDP_CAP_<br>COMPAT  |                                | bit 2: LUT                                        | 0x18                     | 0x3f                    |
| Compatib le capabili ties     |                         |                                | bit 3: BS                                         |                          |                         |
|                               |                         |                                | bit 4: BN                                         |                          |                         |
|                               |                         |                                | bit 5: EW                                         |                          |                         |
|                               |                         |                                | bit 31: 1'b0                                      |                          |                         |
| Baseline<br>paramete rs       | Охс                     | SDP_BASE<br>_FEATURE<br>_TYPES | Supporte d data<br>types of input<br>feature data | 0x10                     | 0x10                    |
|                               | 0x10                    | SDP_BASE<br>_CDMA_ID           | cdma_id                                           | 0x3                      | 0x4                     |
| Capabili ties'<br>paramete rs | 0x14                    | SDP_MULT<br>I_BATCH_ MAX       | max_batc h                                        | 0x0                      | 0x20                    |
|                               | 0x18                    | SDP_BS_THROU<br>GHPUT          | bs_throu ghput                                    | 0x1                      | 0x10                    |

| Reg offset (in<br>Byte) | Reg name                  | Reg fields     | Value in nv_small<br>config | Value in<br>nv_large config |
|-------------------------|---------------------------|----------------|-----------------------------|-----------------------------|
| 0x1c                    | SDP_<br>BN_THROU<br>GHPUT | bn_throu ghput | 0x1                         | 0x10                        |
| 0x20                    | SDP_<br>EW_THROU<br>GHPUT | ew_throu ghput | 0x0                         | 0x4                         |

# PDP\_RDMA

|                             | Reg offset (in<br>Byte) | Reg name                       | Reg fields                                       | Value in nv_small config | Value in nv_large config |
|-----------------------------|-------------------------|--------------------------------|--------------------------------------------------|--------------------------|--------------------------|
|                             | 0x0                     | PDP_<br>RDMA_DES C             | Bits 0-15: unit id.  Bits 16-31: payload length. | 0x000e00 0a              | 0x000e00 0a              |
| Incompat ible capabili ties | 0x4                     | PDP_<br>RDMA_CAP<br>_INCOMPA T |                                                  | 0x0                      | 0x0                      |
| Compatib le capabili ties   | 0x8                     | PDP_<br>RDMA_CAP<br>_COMPAT    |                                                  | 0x0                      | 0x0                      |
| Baseline<br>paramete rs     | 0xc                     | PDP_RDMA<br>_BASE_AT<br>OMIC_M | atomic_m                                         | 0x8                      | 0x20                     |
|                             | 0xe                     | PDP_RDMA<br>_BASE_PD P_ID      | pdp_id (slot id of<br>correspo nding<br>pdp)     | 0xb                      | 0xc                      |

#### PDP

|                             | Reg offset (in<br>Byte) | Reg name                       | Reg fields                                        | Value in nv_small config | Value in nv_large config |
|-----------------------------|-------------------------|--------------------------------|---------------------------------------------------|--------------------------|--------------------------|
|                             | 0x0                     | PDP_DESC                       | Bits 0-15: unit id.  Bits 16-31: payload length.  | 0x001000 0b              | 0x001000 0b              |
| Incompat ible capabili ties | 0x4                     | PDP_CAP_<br>INCOMPAT           |                                                   | 0x0                      | 0x0                      |
| Compatib le capabili ties   | 0x8                     | PDP_CAP_<br>COMPAT             |                                                   | 0x0                      | 0x0                      |
| Baseline<br>paramete rs     | 0xc                     | PDP_BASE<br>_FEATURE<br>_TYPES | Supporte d data<br>types of input<br>feature data | 0x10                     | 0x10                     |
|                             | 0x10                    | PDP_BASE<br>_THROUGH PUT       | throughp ut                                       | 0x1                      | 0x8                      |

# CDP\_RDMA

| Reg offset (in | Don      | Dog fields | Value in nv_small | Value in        |
|----------------|----------|------------|-------------------|-----------------|
| Byte)          | Reg name | Reg fields | config            | nv_large config |

|                             | Reg offset (in<br>Byte) | Reg name                       | Reg fields                                   | Value in nv_small config | Value in nv_large config |
|-----------------------------|-------------------------|--------------------------------|----------------------------------------------|--------------------------|--------------------------|
|                             | 0x0                     | CDP_DESC                       | Bits 0-15: unit id.  Bits 16-31:             | 0x000e00 0c              | 0x000e00 0c              |
|                             |                         |                                | payload length.                              |                          |                          |
| Incompat ible capabili ties | 0x4                     | CDP_<br>RDMA_CAP<br>_INCOMPA T |                                              | 0x0                      | 0x0                      |
| Compatib le capabili ties   | 0x8                     | CDP_<br>RDMA_CAP<br>_COMPAT    |                                              | 0x0                      | 0x0                      |
| Baseline<br>paramete rs     | 0xc                     | CDP_RDMA<br>_BASE_AT<br>OMIC_M | atomic_m                                     | 0x8                      | 0x20                     |
|                             | 0xe                     | CDP_RDMA<br>_BASE_PD P_ID      | cdp_id (slot id of<br>correspo nding<br>cdp) | 0xd                      | 0xe                      |

#### CDP

|                             | Reg offset (in<br>Byte) | Reg name                       | Reg fields                                        | Value in nv_small config | Value in<br>nv_large config |
|-----------------------------|-------------------------|--------------------------------|---------------------------------------------------|--------------------------|-----------------------------|
|                             | 0x0                     | CDP_DESC                       | Bits 0-15: unit id.  Bits 16-31:  payload length. | 0x001000 0d              | 0x001000 0d                 |
| Incompat ible capabili ties | 0x4                     | CDP_CAP_<br>INCOMPAT           |                                                   | 0x0                      | 0x0                         |
| Compatib le capabili ties   | 0x8                     | CDP_CAP_<br>COMPAT             |                                                   | 0x0                      | 0x0                         |
| Baseline<br>paramete rs     | Охс                     | CDP_BASE<br>_FEATURE<br>_TYPES | Supporte d data<br>types of input<br>feature data | 0x10                     | 0x10                        |
|                             | 0x10                    | CDP_BASE<br>THROUGH PUT        | throughp ut                                       | 0x1                      | 0x8                         |

#### **BDMA**

| Reg offset (in Byte) | Reg name  | Reg fields                     | Value in nv_small config | Value in nv_large config |
|----------------------|-----------|--------------------------------|--------------------------|--------------------------|
|                      |           | Bits 0-15: unit id.            |                          |                          |
| 0x0                  | BDMA_DESC | Bits 16-31: payload<br>length. | 0x0004000e               | 0x0004000e               |

#### **RUBIK**

| Reg offset (in Byte) | Reg name | Reg fields  | Value in nv_small | Value in nv_large |
|----------------------|----------|-------------|-------------------|-------------------|
| rieg onset (in byte) | neg name | t Neg Helus | config            | config            |

| Reg offset (in Byte) | Reg name   | Reg fields                     | Value in nv_small config | Value in nv_large config |
|----------------------|------------|--------------------------------|--------------------------|--------------------------|
|                      |            | Bits 0-15: unit id.            |                          |                          |
| 0x0                  | RUBIK_DESC | Bits 16-31: payload<br>length. | 0x0004000f               | 0x0004000f               |

## Supported data types or weight types

Below table lists the fields of registers of supported data types or weight types in above sections

| Bit | Data type or Weight type |
|-----|--------------------------|
| 0   | Binary                   |
| 1   | INT4                     |
| 2   | UINT4                    |
| 3   | INT8                     |
| 4   | UINT8                    |
| 5   | INT16                    |
| 6   | UINT16                   |
| 7   | INT32                    |
| 8   | UINT32                   |
| 9   | FP16                     |
| 10  | FP32                     |
| 11  | FP64                     |
|     |                          |

## Supported packed image formats

Below table lists the fields of registers of supported packed image formats in above sections

| Bit | Image format   |
|-----|----------------|
| 0   | R8             |
| 1   | R10            |
| 2   | R12            |
| 3   | R16            |
| 4   | R16_I          |
| 5   | R16_F          |
| 6   | A16B16G16R16   |
| 7   | X16B16G16R16   |
| 8   | A16B16G16R16_F |
| 9   | A16Y16U16V16   |
| 10  | V16U16Y16A16   |
| 11  | A16Y16U16V16_F |
| 12  | A8B8G8R8       |
| 13  | A8R8G8B8       |
| 14  | B8G8R8A8       |
| 15  | R8G8B8A8       |
| 16  | X8B8G8R8       |
| 17  | X8R8G8B8       |
| 18  | B8G8R8X8       |
|     |                |

| Bit | Image format |
|-----|--------------|
| 19  | R8G8B8X8     |
| 20  | A2B10G10R10  |
| 21  | A2R10G10B10  |
| 22  | B10G10R10A2  |
| 23  | R10G10B10A2  |
| 24  | A2Y10U10V10  |
| 25  | V10U10Y10A2  |
| 26  | A8Y8U8V8     |
| 27  | V8U8Y8A8     |

## Supported semi-planar image formats

Below table lists the fields of registers of supported semi-planar image formats in above sections

| Bit | lmage format   |
|-----|----------------|
| 0   | Y8U8V8_N444    |
| 1   | Y8V8U8_N444    |
| 2   | Y10U10V10_N444 |
| 3   | Y10V10U10_N444 |
| 4   | Y12U12V12_N444 |
| 5   | Y12V12U12_N444 |
| 6   | Y16U16V16_N444 |
| 7   | Y16V16U16_N444 |

# Address space layout

In the address space layout, the order of sub-units is same as the order of the descriptors in Configuration ROM. The size of one slot is 4KB.

nv\_small:





68KB